Voiced Excitation Models for Speech Production Based on Time Variable Volterra Systems

نویسندگان

  • Karl Schnell
  • Arild Lacroix
چکیده

The speech production can be modeled by linear and nonlinear systems. In this contribution a time variable nonlinear Volterra system is used to model the fluctuations of the voiced excitation while a linear system models the resonances of the speech production system. The estimation of the Volterra system is performed by a prediction algorithm. This is enabled by a description of the prediction problem as an approximation by a series expansion. Speech examples show that the use of a time variable Volterra system improves the naturalness of the synthetic speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-Order Representation Based Method for Epoch Detection from Speech Signals

Epochs present in the voiced speech are defined as time instants of significant excitation of the vocal tract system during the production of speech. Nonstationary nature of excitation source and vocal tract system makes accurate identification of epochs a difficult task. Most of the existing methods for epoch detection require prior knowledge of voiced regions and a rough estimation of pitch f...

متن کامل

Efficient mixed excitation models in LPC based prototype interpolation speech coders

This paper presents a new and efficient method for modelling voiced, mixed excitation spectra in Sinusoidal (SC) and Prototype Interpolation Coding (PIC) systems. Speech harmonics are classified as “weak-voiced” or “strong-voiced” by simply examining the short-term residual magnitude spectrum. This information is encoded effectively in terms of fixed width frequency bands and is used to control...

متن کامل

Speech enhancement using voice source models

Autoregressive (AR) models have been shown to be effective models of speech signal. However, although it is the most common mode1 of speech, an AR process excited by white noise for speech enhancement, fails to capture the effects of source excitation, especidy the quasi periodic nature of voiced speech. Speech synthesis researchers have long recognized this ~roblern and have developed a variet...

متن کامل

Uniform concatenative excitation model for synthesising speech without voiced/unvoiced classification

In general, speech synthesis using the source-filter model of speech production requires the classification of speech into two classes (voiced and unvoiced) which is prone to errors. For voiced speech, the input of the synthesis filter is an approximately periodic excitation, whereas it is a noise signal for unvoiced. This paper proposes an excitation model which can be used to synthesise both ...

متن کامل

Multipulse Sequences for Residual Signal Modeling

In source-filter models of speech production, the residual signal what remains after passing the speech signal through the inverse filter contains important information for the generation of naturally sounding re-synthesized speech. Typically, the voiced regions of residual signals are regarded as a mixture of glottal pulse and noise. This paper introduces a novel approach to represent the nois...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005